Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 2460 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 410.5 KiB |
| Average record size in memory | 170.9 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 9 |
year has constant value "2016" | Constant |
date has a high cardinality: 203 distinct values | High cardinality |
start_time has a high cardinality: 207 distinct values | High cardinality |
away_team_hits is highly overall correlated with away_team_runs | High correlation |
away_team_runs is highly overall correlated with away_team_hits | High correlation |
home_team_hits is highly overall correlated with home_team_runs | High correlation |
home_team_runs is highly overall correlated with home_team_hits | High correlation |
game_type is highly overall correlated with week_day | High correlation |
home_team is highly overall correlated with venue and 1 other fields | High correlation |
venue is highly overall correlated with home_team and 1 other fields | High correlation |
week_day is highly overall correlated with game_type | High correlation |
fiedl_type is highly overall correlated with home_team and 1 other fields | High correlation |
fiedl_type is highly imbalanced (64.2%) | Imbalance |
away_team is uniformly distributed | Uniform |
home_team is uniformly distributed | Uniform |
away_team_errors has 1407 (57.2%) zeros | Zeros |
away_team_runs has 156 (6.3%) zeros | Zeros |
home_team_errors has 1416 (57.6%) zeros | Zeros |
home_team_runs has 130 (5.3%) zeros | Zeros |
Reproduction
| Analysis started | 2023-02-04 00:51:04.342784 |
|---|---|
| Analysis finished | 2023-02-04 00:51:16.572065 |
| Duration | 12.23 seconds |
| Software version | pandas-profiling v3.6.6 |
| Download configuration | config.json |
attendance
Real number (ℝ)
| Distinct | 2374 |
|---|---|
| Distinct (%) | 96.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30370.704 |
| Minimum | 8766 |
|---|---|
| Maximum | 54449 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.4 KiB |
Quantile statistics
| Minimum | 8766 |
|---|---|
| 5-th percentile | 13971.15 |
| Q1 | 22432 |
| median | 30604.5 |
| Q3 | 38396.25 |
| 95-th percentile | 45624.1 |
| Maximum | 54449 |
| Range | 45683 |
| Interquartile range (IQR) | 15964.25 |
Descriptive statistics
| Standard deviation | 9875.4667 |
|---|---|
| Coefficient of variation (CV) | 0.32516424 |
| Kurtosis | -0.90285907 |
| Mean | 30370.704 |
| Median Absolute Deviation (MAD) | 8006 |
| Skewness | -0.052483633 |
| Sum | 74711931 |
| Variance | 97524843 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 27631 | 3 | 0.1% |
| 41210 | 2 | 0.1% |
| 41850 | 2 | 0.1% |
| 13481 | 2 | 0.1% |
| 44317 | 2 | 0.1% |
| 34294 | 2 | 0.1% |
| 39691 | 2 | 0.1% |
| 36544 | 2 | 0.1% |
| 22230 | 2 | 0.1% |
| 26087 | 2 | 0.1% |
| Other values (2364) | 2439 |
| Value | Count | Frequency (%) |
| 8766 | 1 | |
| 9393 | 1 | |
| 9890 | 1 | |
| 10068 | 1 | |
| 10072 | 1 | |
| 10114 | 1 | |
| 10115 | 1 | |
| 10117 | 1 | |
| 10251 | 1 | |
| 10283 | 1 |
| Value | Count | Frequency (%) |
| 54449 | 2 | |
| 54269 | 1 | |
| 53901 | 1 | |
| 53621 | 1 | |
| 53449 | 1 | |
| 53409 | 1 | |
| 53299 | 1 | |
| 53297 | 1 | |
| 53279 | 1 | |
| 52728 | 1 |
away_team
Categorical
| Distinct | 30 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.4 KiB |
| Chicago Cubs | 90 |
|---|---|
| Los Angeles Dodgers | 87 |
| Cleveland Indians | 86 |
| Toronto Blue Jays | 85 |
| San Francisco Giants | 84 |
| Other values (25) |
Length
| Max length | 29 |
|---|---|
| Median length | 19 |
| Mean length | 16.694309 |
| Min length | 12 |
Characters and Unicode
| Total characters | 41068 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | New York Mets |
|---|---|
| 2nd row | Philadelphia Phillies |
| 3rd row | Minnesota Twins |
| 4th row | Washington Nationals |
| 5th row | Colorado Rockies |
Common Values
| Value | Count | Frequency (%) |
| Chicago Cubs | 90 | 3.7% |
| Los Angeles Dodgers | 87 | 3.5% |
| Cleveland Indians | 86 | 3.5% |
| Toronto Blue Jays | 85 | 3.5% |
| San Francisco Giants | 84 | 3.4% |
| Boston Red Sox | 83 | 3.4% |
| Washington Nationals | 83 | 3.4% |
| Baltimore Orioles | 82 | 3.3% |
| Texas Rangers | 82 | 3.3% |
| Cincinnati Reds | 81 | 3.3% |
| Other values (20) | 1617 |
Length
| Value | Count | Frequency (%) |
| chicago | 171 | 2.8% |
| angeles | 168 | 2.8% |
| los | 168 | 2.8% |
| san | 165 | 2.7% |
| sox | 164 | 2.7% |
| new | 161 | 2.7% |
| york | 161 | 2.7% |
| cubs | 90 | 1.5% |
| dodgers | 87 | 1.4% |
| cleveland | 86 | 1.4% |
| Other values (57) | 4646 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3762 | 9.2% |
| 3607 | 8.8% | |
| s | 3530 | 8.6% |
| e | 3275 | 8.0% |
| i | 3021 | 7.4% |
| o | 2800 | 6.8% |
| n | 2714 | 6.6% |
| t | 2035 | 5.0% |
| r | 1876 | 4.6% |
| l | 1804 | 4.4% |
| Other values (36) | 12644 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 31395 | |
| Uppercase Letter | 5986 | 14.6% |
| Space Separator | 3607 | 8.8% |
| Other Punctuation | 80 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3762 | |
| s | 3530 | |
| e | 3275 | |
| i | 3021 | |
| o | 2800 | |
| n | 2714 | |
| t | 2035 | 6.5% |
| r | 1876 | 6.0% |
| l | 1804 | 5.7% |
| g | 915 | 2.9% |
| Other values (14) | 5663 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 670 | |
| A | 653 | |
| B | 492 | 8.2% |
| S | 490 | 8.2% |
| R | 489 | 8.2% |
| M | 485 | 8.1% |
| T | 410 | 6.8% |
| P | 405 | 6.8% |
| D | 330 | 5.5% |
| L | 248 | 4.1% |
| Other values (10) | 1314 |
Space Separator
| Value | Count | Frequency (%) |
| 3607 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 80 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 37381 | |
| Common | 3687 | 9.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3762 | 10.1% |
| s | 3530 | 9.4% |
| e | 3275 | 8.8% |
| i | 3021 | 8.1% |
| o | 2800 | 7.5% |
| n | 2714 | 7.3% |
| t | 2035 | 5.4% |
| r | 1876 | 5.0% |
| l | 1804 | 4.8% |
| g | 915 | 2.4% |
| Other values (34) | 11649 |
Common
| Value | Count | Frequency (%) |
| 3607 | ||
| . | 80 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 41068 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3762 | 9.2% |
| 3607 | 8.8% | |
| s | 3530 | 8.6% |
| e | 3275 | 8.0% |
| i | 3021 | 7.4% |
| o | 2800 | 6.8% |
| n | 2714 | 6.6% |
| t | 2035 | 5.0% |
| r | 1876 | 4.6% |
| l | 1804 | 4.4% |
| Other values (36) | 12644 |
away_team_errors
Real number (ℝ)
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5800813 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 1407 |
| Zeros (%) | 57.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.79322671 |
|---|---|
| Coefficient of variation (CV) | 1.3674406 |
| Kurtosis | 2.1811828 |
| Mean | 0.5800813 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.4582673 |
| Sum | 1427 |
| Variance | 0.62920861 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1407 | |
| 1 | 765 | |
| 2 | 215 | 8.7% |
| 3 | 61 | 2.5% |
| 4 | 11 | 0.4% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1407 | |
| 1 | 765 | |
| 2 | 215 | 8.7% |
| 3 | 61 | 2.5% |
| 4 | 11 | 0.4% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 1 | < 0.1% |
| 4 | 11 | 0.4% |
| 3 | 61 | 2.5% |
| 2 | 215 | 8.7% |
| 1 | 765 | |
| 0 | 1407 |
away_team_hits
Real number (ℝ)
| Distinct | 22 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.7670732 |
| Minimum | 1 |
|---|---|
| Maximum | 22 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 6 |
| median | 8 |
| Q3 | 11 |
| 95-th percentile | 15 |
| Maximum | 22 |
| Range | 21 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.5126873 |
|---|---|
| Coefficient of variation (CV) | 0.40066819 |
| Kurtosis | 0.13926584 |
| Mean | 8.7670732 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.51233243 |
| Sum | 21567 |
| Variance | 12.338972 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 293 | |
| 7 | 287 | |
| 8 | 275 | |
| 10 | 238 | |
| 6 | 225 | |
| 5 | 197 | |
| 11 | 194 | |
| 4 | 135 | 5.5% |
| 12 | 132 | 5.4% |
| 13 | 113 | 4.6% |
| Other values (12) | 371 |
| Value | Count | Frequency (%) |
| 1 | 7 | 0.3% |
| 2 | 26 | 1.1% |
| 3 | 83 | 3.4% |
| 4 | 135 | |
| 5 | 197 | |
| 6 | 225 | |
| 7 | 287 | |
| 8 | 275 | |
| 9 | 293 | |
| 10 | 238 |
| Value | Count | Frequency (%) |
| 22 | 4 | 0.2% |
| 21 | 1 | < 0.1% |
| 20 | 2 | 0.1% |
| 19 | 12 | 0.5% |
| 18 | 15 | 0.6% |
| 17 | 28 | 1.1% |
| 16 | 40 | 1.6% |
| 15 | 66 | |
| 14 | 87 | |
| 13 | 113 |
away_team_runs
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 20 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.4150407 |
| Minimum | 0 |
|---|---|
| Maximum | 21 |
| Zeros | 156 |
| Zeros (%) | 6.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 10 |
| Maximum | 21 |
| Range | 21 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.1053905 |
|---|---|
| Coefficient of variation (CV) | 0.70336622 |
| Kurtosis | 1.0089307 |
| Mean | 4.4150407 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.93889878 |
| Sum | 10861 |
| Variance | 9.64345 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 347 | |
| 2 | 340 | |
| 4 | 308 | |
| 5 | 272 | |
| 1 | 265 | |
| 6 | 217 | |
| 7 | 179 | |
| 0 | 156 | |
| 8 | 115 | 4.7% |
| 9 | 85 | 3.5% |
| Other values (10) | 176 |
| Value | Count | Frequency (%) |
| 0 | 156 | |
| 1 | 265 | |
| 2 | 340 | |
| 3 | 347 | |
| 4 | 308 | |
| 5 | 272 | |
| 6 | 217 | |
| 7 | 179 | |
| 8 | 115 | 4.7% |
| 9 | 85 | 3.5% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 18 | 1 | < 0.1% |
| 17 | 1 | < 0.1% |
| 16 | 5 | 0.2% |
| 15 | 10 | 0.4% |
| 14 | 7 | 0.3% |
| 13 | 21 | 0.9% |
| 12 | 25 | 1.0% |
| 11 | 36 | |
| 10 | 69 |
date
Categorical
| Distinct | 203 |
|---|---|
| Distinct (%) | 8.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.4 KiB |
| June 25 | 16 |
|---|---|
| May 11 | 16 |
| September 17 | 16 |
| August 16 | 16 |
| May 7 | 16 |
| Other values (198) |
Length
| Max length | 12 |
|---|---|
| Median length | 10 |
| Mean length | 7.9321138 |
| Min length | 5 |
Characters and Unicode
| Total characters | 19513 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 16 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | April 3 |
|---|---|
| 2nd row | April 6 |
| 3rd row | April 6 |
| 4th row | April 6 |
| 5th row | April 6 |
Common Values
| Value | Count | Frequency (%) |
| June 25 | 16 | 0.7% |
| May 11 | 16 | 0.7% |
| September 17 | 16 | 0.7% |
| August 16 | 16 | 0.7% |
| May 7 | 16 | 0.7% |
| May 18 | 16 | 0.7% |
| July 20 | 16 | 0.7% |
| May 14 | 16 | 0.7% |
| August 31 | 16 | 0.7% |
| September 14 | 15 | 0.6% |
| Other values (193) | 2301 |
Length
| Value | Count | Frequency (%) |
| august | 424 | 8.6% |
| may | 423 | 8.6% |
| september | 408 | 8.3% |
| june | 406 | 8.3% |
| july | 380 | 7.7% |
| april | 354 | 7.2% |
| 17 | 91 | 1.8% |
| 24 | 90 | 1.8% |
| 10 | 89 | 1.8% |
| 20 | 87 | 1.8% |
| Other values (29) | 2168 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2460 | 12.6% | |
| e | 1697 | 8.7% |
| u | 1634 | 8.4% |
| 1 | 1053 | 5.4% |
| 2 | 1050 | 5.4% |
| t | 895 | 4.6% |
| r | 827 | 4.2% |
| y | 803 | 4.1% |
| J | 786 | 4.0% |
| A | 778 | 4.0% |
| Other values (24) | 7530 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10396 | |
| Decimal Number | 4197 | |
| Space Separator | 2460 | 12.6% |
| Uppercase Letter | 2460 | 12.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1697 | |
| u | 1634 | |
| t | 895 | |
| r | 827 | |
| y | 803 | |
| p | 762 | |
| l | 734 | |
| b | 473 | 4.5% |
| g | 424 | 4.1% |
| s | 424 | 4.1% |
| Other values (7) | 1723 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1053 | |
| 2 | 1050 | |
| 3 | 360 | 8.6% |
| 0 | 261 | 6.2% |
| 7 | 260 | 6.2% |
| 9 | 248 | 5.9% |
| 4 | 247 | 5.9% |
| 5 | 246 | 5.9% |
| 6 | 241 | 5.7% |
| 8 | 231 | 5.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 786 | |
| A | 778 | |
| M | 423 | |
| S | 408 | |
| O | 63 | 2.6% |
| N | 2 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2460 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12856 | |
| Common | 6657 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1697 | |
| u | 1634 | |
| t | 895 | 7.0% |
| r | 827 | 6.4% |
| y | 803 | 6.2% |
| J | 786 | 6.1% |
| A | 778 | 6.1% |
| p | 762 | 5.9% |
| l | 734 | 5.7% |
| b | 473 | 3.7% |
| Other values (13) | 3467 |
Common
| Value | Count | Frequency (%) |
| 2460 | ||
| 1 | 1053 | |
| 2 | 1050 | |
| 3 | 360 | 5.4% |
| 0 | 261 | 3.9% |
| 7 | 260 | 3.9% |
| 9 | 248 | 3.7% |
| 4 | 247 | 3.7% |
| 5 | 246 | 3.7% |
| 6 | 241 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19513 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2460 | 12.6% | |
| e | 1697 | 8.7% |
| u | 1634 | 8.4% |
| 1 | 1053 | 5.4% |
| 2 | 1050 | 5.4% |
| t | 895 | 4.6% |
| r | 827 | 4.2% |
| y | 803 | 4.1% |
| J | 786 | 4.0% |
| A | 778 | 4.0% |
| Other values (24) | 7530 |
game_duration
Real number (ℝ)
| Distinct | 168 |
|---|---|
| Distinct (%) | 6.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 288.63333 |
| Minimum | 115 |
|---|---|
| Maximum | 613 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.4 KiB |
Quantile statistics
| Minimum | 115 |
|---|---|
| 5-th percentile | 229 |
| Q1 | 247 |
| median | 302 |
| Q3 | 319 |
| 95-th percentile | 352 |
| Maximum | 613 |
| Range | 498 |
| Interquartile range (IQR) | 72 |
Descriptive statistics
| Standard deviation | 49.449673 |
|---|---|
| Coefficient of variation (CV) | 0.1713235 |
| Kurtosis | 3.3939088 |
| Mean | 288.63333 |
| Median Absolute Deviation (MAD) | 44 |
| Skewness | 1.1491938 |
| Sum | 710038 |
| Variance | 2445.2701 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 255 | 55 | 2.2% |
| 256 | 54 | 2.2% |
| 300 | 53 | 2.2% |
| 254 | 51 | 2.1% |
| 308 | 47 | 1.9% |
| 304 | 47 | 1.9% |
| 303 | 46 | 1.9% |
| 305 | 46 | 1.9% |
| 252 | 45 | 1.8% |
| 307 | 45 | 1.8% |
| Other values (158) | 1971 |
| Value | Count | Frequency (%) |
| 115 | 1 | < 0.1% |
| 155 | 1 | < 0.1% |
| 202 | 1 | < 0.1% |
| 206 | 1 | < 0.1% |
| 207 | 1 | < 0.1% |
| 208 | 2 | |
| 210 | 4 | |
| 211 | 3 | |
| 212 | 3 | |
| 213 | 2 |
| Value | Count | Frequency (%) |
| 613 | 1 | |
| 556 | 1 | |
| 548 | 1 | |
| 547 | 1 | |
| 534 | 1 | |
| 526 | 1 | |
| 525 | 2 | |
| 523 | 1 | |
| 518 | 1 | |
| 510 | 1 |
game_type
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.4 KiB |
| Night Game | |
|---|---|
| Day Game |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.3528455 |
| Min length | 8 |
Characters and Unicode
| Total characters | 23008 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Night Game |
|---|---|
| 2nd row | Night Game |
| 3rd row | Night Game |
| 4th row | Night Game |
| 5th row | Day Game |
Common Values
| Value | Count | Frequency (%) |
| Night Game | 1664 | |
| Day Game | 796 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| game | 2460 | |
| night | 1664 | |
| day | 796 | 16.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3256 | |
| 2460 | ||
| G | 2460 | |
| m | 2460 | |
| e | 2460 | |
| N | 1664 | |
| i | 1664 | |
| g | 1664 | |
| h | 1664 | |
| t | 1664 | |
| Other values (2) | 1592 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15628 | |
| Uppercase Letter | 4920 | 21.4% |
| Space Separator | 2460 | 10.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3256 | |
| m | 2460 | |
| e | 2460 | |
| i | 1664 | |
| g | 1664 | |
| h | 1664 | |
| t | 1664 | |
| y | 796 | 5.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 2460 | |
| N | 1664 | |
| D | 796 | 16.2% |
Space Separator
| Value | Count | Frequency (%) |
| 2460 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20548 | |
| Common | 2460 | 10.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3256 | |
| G | 2460 | |
| m | 2460 | |
| e | 2460 | |
| N | 1664 | |
| i | 1664 | |
| g | 1664 | |
| h | 1664 | |
| t | 1664 | |
| D | 796 | 3.9% |
Common
| Value | Count | Frequency (%) |
| 2460 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23008 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3256 | |
| 2460 | ||
| G | 2460 | |
| m | 2460 | |
| e | 2460 | |
| N | 1664 | |
| i | 1664 | |
| g | 1664 | |
| h | 1664 | |
| t | 1664 | |
| Other values (2) | 1592 |
home_team
Categorical
HIGH CORRELATION  UNIFORM 
| Distinct | 30 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.4 KiB |
| Cleveland Indians | 89 |
|---|---|
| Chicago Cubs | 89 |
| Los Angeles Dodgers | 86 |
| Toronto Blue Jays | 86 |
| Washington Nationals | 84 |
| Other values (25) |
Length
| Max length | 29 |
|---|---|
| Median length | 19 |
| Mean length | 16.695528 |
| Min length | 12 |
Characters and Unicode
| Total characters | 41071 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Kansas City Royals |
|---|---|
| 2nd row | Cincinnati Reds |
| 3rd row | Baltimore Orioles |
| 4th row | Atlanta Braves |
| 5th row | Arizona Diamondbacks |
Common Values
| Value | Count | Frequency (%) |
| Cleveland Indians | 89 | 3.6% |
| Chicago Cubs | 89 | 3.6% |
| Los Angeles Dodgers | 86 | 3.5% |
| Toronto Blue Jays | 86 | 3.5% |
| Washington Nationals | 84 | 3.4% |
| Texas Rangers | 83 | 3.4% |
| San Francisco Giants | 83 | 3.4% |
| Boston Red Sox | 82 | 3.3% |
| Kansas City Royals | 81 | 3.3% |
| Cincinnati Reds | 81 | 3.3% |
| Other values (20) | 1616 |
Length
| Value | Count | Frequency (%) |
| chicago | 169 | 2.8% |
| los | 167 | 2.8% |
| angeles | 167 | 2.8% |
| san | 164 | 2.7% |
| sox | 162 | 2.7% |
| new | 162 | 2.7% |
| york | 162 | 2.7% |
| indians | 89 | 1.5% |
| cleveland | 89 | 1.5% |
| cubs | 89 | 1.5% |
| Other values (57) | 4646 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3770 | 9.2% |
| 3606 | 8.8% | |
| s | 3530 | 8.6% |
| e | 3277 | 8.0% |
| i | 3014 | 7.3% |
| o | 2795 | 6.8% |
| n | 2724 | 6.6% |
| t | 2033 | 4.9% |
| r | 1872 | 4.6% |
| l | 1810 | 4.4% |
| Other values (36) | 12640 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 31399 | |
| Uppercase Letter | 5985 | 14.6% |
| Space Separator | 3606 | 8.8% |
| Other Punctuation | 81 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3770 | |
| s | 3530 | |
| e | 3277 | |
| i | 3014 | |
| o | 2795 | |
| n | 2724 | |
| t | 2033 | 6.5% |
| r | 1872 | 6.0% |
| l | 1810 | 5.8% |
| d | 913 | 2.9% |
| Other values (14) | 5661 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 671 | |
| A | 653 | |
| B | 492 | 8.2% |
| R | 489 | 8.2% |
| S | 488 | 8.2% |
| M | 484 | 8.1% |
| T | 411 | 6.9% |
| P | 403 | 6.7% |
| D | 328 | 5.5% |
| L | 248 | 4.1% |
| Other values (10) | 1318 |
Space Separator
| Value | Count | Frequency (%) |
| 3606 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 81 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 37384 | |
| Common | 3687 | 9.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3770 | 10.1% |
| s | 3530 | 9.4% |
| e | 3277 | 8.8% |
| i | 3014 | 8.1% |
| o | 2795 | 7.5% |
| n | 2724 | 7.3% |
| t | 2033 | 5.4% |
| r | 1872 | 5.0% |
| l | 1810 | 4.8% |
| d | 913 | 2.4% |
| Other values (34) | 11646 |
Common
| Value | Count | Frequency (%) |
| 3606 | ||
| . | 81 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 41071 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3770 | 9.2% |
| 3606 | 8.8% | |
| s | 3530 | 8.6% |
| e | 3277 | 8.0% |
| i | 3014 | 7.3% |
| o | 2795 | 6.8% |
| n | 2724 | 6.6% |
| t | 2033 | 4.9% |
| r | 1872 | 4.6% |
| l | 1810 | 4.4% |
| Other values (36) | 12640 |
home_team_errors
Real number (ℝ)
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.58617886 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 1416 |
| Zeros (%) | 57.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.80581712 |
|---|---|
| Coefficient of variation (CV) | 1.3746949 |
| Kurtosis | 2.0546943 |
| Mean | 0.58617886 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.4410193 |
| Sum | 1442 |
| Variance | 0.64934123 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1416 | |
| 1 | 732 | |
| 2 | 241 | 9.8% |
| 3 | 57 | 2.3% |
| 4 | 13 | 0.5% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1416 | |
| 1 | 732 | |
| 2 | 241 | 9.8% |
| 3 | 57 | 2.3% |
| 4 | 13 | 0.5% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 1 | < 0.1% |
| 4 | 13 | 0.5% |
| 3 | 57 | 2.3% |
| 2 | 241 | 9.8% |
| 1 | 732 | |
| 0 | 1416 |
home_team_hits
Real number (ℝ)
| Distinct | 23 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.6113821 |
| Minimum | 0 |
|---|---|
| Maximum | 22 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 6 |
| median | 8 |
| Q3 | 11 |
| 95-th percentile | 15 |
| Maximum | 22 |
| Range | 22 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.4386792 |
|---|---|
| Coefficient of variation (CV) | 0.39931792 |
| Kurtosis | 0.18147362 |
| Mean | 8.6113821 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.47623304 |
| Sum | 21184 |
| Variance | 11.824515 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 294 | |
| 7 | 275 | |
| 9 | 274 | |
| 6 | 257 | |
| 10 | 236 | |
| 11 | 194 | |
| 5 | 184 | |
| 12 | 165 | |
| 4 | 151 | |
| 13 | 96 | 3.9% |
| Other values (13) | 334 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 1 | 12 | 0.5% |
| 2 | 33 | 1.3% |
| 3 | 72 | 2.9% |
| 4 | 151 | |
| 5 | 184 | |
| 6 | 257 | |
| 7 | 275 | |
| 8 | 294 | |
| 9 | 274 |
| Value | Count | Frequency (%) |
| 22 | 1 | < 0.1% |
| 21 | 3 | 0.1% |
| 20 | 1 | < 0.1% |
| 19 | 10 | 0.4% |
| 18 | 17 | 0.7% |
| 17 | 28 | 1.1% |
| 16 | 31 | 1.3% |
| 15 | 36 | 1.5% |
| 14 | 89 | |
| 13 | 96 |
home_team_runs
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 18 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.5203252 |
| Minimum | 0 |
|---|---|
| Maximum | 17 |
| Zeros | 130 |
| Zeros (%) | 5.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 11 |
| Maximum | 17 |
| Range | 17 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.1125024 |
|---|---|
| Coefficient of variation (CV) | 0.68855719 |
| Kurtosis | 0.83058099 |
| Mean | 4.5203252 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.92009549 |
| Sum | 11120 |
| Variance | 9.6876713 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 354 | |
| 2 | 312 | |
| 4 | 305 | |
| 5 | 301 | |
| 1 | 273 | |
| 6 | 207 | |
| 7 | 190 | |
| 0 | 130 | 5.3% |
| 8 | 130 | 5.3% |
| 9 | 79 | 3.2% |
| Other values (8) | 179 |
| Value | Count | Frequency (%) |
| 0 | 130 | 5.3% |
| 1 | 273 | |
| 2 | 312 | |
| 3 | 354 | |
| 4 | 305 | |
| 5 | 301 | |
| 6 | 207 | |
| 7 | 190 | |
| 8 | 130 | 5.3% |
| 9 | 79 | 3.2% |
| Value | Count | Frequency (%) |
| 17 | 4 | 0.2% |
| 16 | 5 | 0.2% |
| 15 | 4 | 0.2% |
| 14 | 16 | 0.7% |
| 13 | 26 | 1.1% |
| 12 | 33 | 1.3% |
| 11 | 37 | 1.5% |
| 10 | 54 | |
| 9 | 79 | |
| 8 | 130 |
start_time
Categorical
| Distinct | 207 |
|---|---|
| Distinct (%) | 8.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.4 KiB |
| 7:10pm | |
|---|---|
| 7:11pm | |
| 7:07pm | |
| 7:08pm | 138 |
| 1:10pm | 121 |
| Other values (202) |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.0512195 |
| Min length | 7 |
Characters and Unicode
| Total characters | 17346 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 79 ? |
|---|---|
| Unique (%) | 3.2% |
Sample
| 1st row | 7:38pm |
|---|---|
| 2nd row | 7:11pm |
| 3rd row | 7:07pm |
| 4th row | 7:10pm |
| 5th row | 12:40pm |
Common Values
| Value | Count | Frequency (%) |
| 7:10pm | 413 | 16.8% |
| 7:11pm | 213 | 8.7% |
| 7:07pm | 195 | 7.9% |
| 7:08pm | 138 | 5.6% |
| 1:10pm | 121 | 4.9% |
| 1:11pm | 82 | 3.3% |
| 7:09pm | 80 | 3.3% |
| 7:15pm | 66 | 2.7% |
| 6:40pm | 51 | 2.1% |
| 7:06pm | 50 | 2.0% |
| Other values (197) | 1051 |
Length
| Value | Count | Frequency (%) |
| 7:10pm | 413 | 16.8% |
| 7:11pm | 213 | 8.7% |
| 7:07pm | 195 | 7.9% |
| 7:08pm | 138 | 5.6% |
| 1:10pm | 121 | 4.9% |
| 1:11pm | 82 | 3.3% |
| 7:09pm | 80 | 3.3% |
| 7:15pm | 66 | 2.7% |
| 6:40pm | 51 | 2.1% |
| 7:06pm | 50 | 2.0% |
| Other values (197) | 1051 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2460 | ||
| : | 2460 | |
| m | 2460 | |
| p | 2458 | |
| 1 | 2401 | |
| 7 | 1678 | |
| 0 | 1476 | |
| 6 | 432 | 2.5% |
| 2 | 358 | 2.1% |
| 4 | 287 | 1.7% |
| Other values (5) | 876 | 5.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7506 | |
| Lowercase Letter | 4920 | |
| Space Separator | 2460 | 14.2% |
| Other Punctuation | 2460 | 14.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2401 | |
| 7 | 1678 | |
| 0 | 1476 | |
| 6 | 432 | 5.8% |
| 2 | 358 | 4.8% |
| 4 | 287 | 3.8% |
| 8 | 286 | 3.8% |
| 5 | 230 | 3.1% |
| 3 | 206 | 2.7% |
| 9 | 152 | 2.0% |
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 2460 | |
| p | 2458 | |
| a | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2460 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 2460 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12426 | |
| Latin | 4920 | 28.4% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2460 | ||
| : | 2460 | |
| 1 | 2401 | |
| 7 | 1678 | |
| 0 | 1476 | |
| 6 | 432 | 3.5% |
| 2 | 358 | 2.9% |
| 4 | 287 | 2.3% |
| 8 | 286 | 2.3% |
| 5 | 230 | 1.9% |
| Other values (2) | 358 | 2.9% |
Latin
| Value | Count | Frequency (%) |
| m | 2460 | |
| p | 2458 | |
| a | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17346 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2460 | ||
| : | 2460 | |
| m | 2460 | |
| p | 2458 | |
| 1 | 2401 | |
| 7 | 1678 | |
| 0 | 1476 | |
| 6 | 432 | 2.5% |
| 2 | 358 | 2.1% |
| 4 | 287 | 1.7% |
| Other values (5) | 876 | 5.1% |
venue
Categorical
| Distinct | 31 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.4 KiB |
| Progressive Field | 89 |
|---|---|
| Wrigley Field | 89 |
| Dodger Stadium | 86 |
| Rogers Centre | 86 |
| Nationals Park | 84 |
| Other values (26) |
Length
| Max length | 32 |
|---|---|
| Median length | 20 |
| Mean length | 16.528049 |
| Min length | 9 |
Characters and Unicode
| Total characters | 40659 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Kauffman Stadium |
|---|---|
| 2nd row | Great American Ball Park |
| 3rd row | Oriole Park at Camden Yards |
| 4th row | Turner Field |
| 5th row | Chase Field |
Common Values
| Value | Count | Frequency (%) |
| Progressive Field | 89 | 3.6% |
| Wrigley Field | 89 | 3.6% |
| Dodger Stadium | 86 | 3.5% |
| Rogers Centre | 86 | 3.5% |
| Nationals Park | 84 | 3.4% |
| Globe Life Park in Arlington | 83 | 3.4% |
| AT&T Park | 83 | 3.4% |
| Fenway Park | 82 | 3.3% |
| Kauffman Stadium | 81 | 3.3% |
| Great American Ball Park | 81 | 3.3% |
| Other values (21) | 1616 |
Length
| Value | Count | Frequency (%) |
| park | 1059 | 17.0% |
| field | 824 | 13.2% |
| stadium | 410 | 6.6% |
| iii | 162 | 2.6% |
| progressive | 89 | 1.4% |
| wrigley | 89 | 1.4% |
| rogers | 86 | 1.4% |
| centre | 86 | 1.4% |
| dodger | 86 | 1.4% |
| nationals | 84 | 1.4% |
| Other values (42) | 3247 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6222 | ||
| a | 3661 | 9.0% |
| e | 3300 | 8.1% |
| i | 2877 | 7.1% |
| r | 2717 | 6.7% |
| l | 2212 | 5.4% |
| d | 1725 | 4.2% |
| n | 1633 | 4.0% |
| o | 1321 | 3.2% |
| t | 1312 | 3.2% |
| Other values (36) | 13679 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 27325 | |
| Uppercase Letter | 6788 | 16.7% |
| Space Separator | 6222 | 15.3% |
| Other Punctuation | 243 | 0.6% |
| Dash Punctuation | 81 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3661 | |
| e | 3300 | |
| i | 2877 | |
| r | 2717 | |
| l | 2212 | |
| d | 1725 | 6.3% |
| n | 1633 | 6.0% |
| o | 1321 | 4.8% |
| t | 1312 | 4.8% |
| k | 1302 | 4.8% |
| Other values (13) | 5265 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1309 | |
| F | 907 | |
| C | 893 | |
| S | 571 | |
| A | 490 | 7.2% |
| I | 486 | 7.2% |
| T | 408 | 6.0% |
| M | 323 | 4.8% |
| B | 244 | 3.6% |
| N | 164 | 2.4% |
| Other values (9) | 993 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 160 | |
| & | 83 |
Space Separator
| Value | Count | Frequency (%) |
| 6222 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 81 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 34113 | |
| Common | 6546 | 16.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3661 | 10.7% |
| e | 3300 | 9.7% |
| i | 2877 | 8.4% |
| r | 2717 | 8.0% |
| l | 2212 | 6.5% |
| d | 1725 | 5.1% |
| n | 1633 | 4.8% |
| o | 1321 | 3.9% |
| t | 1312 | 3.8% |
| P | 1309 | 3.8% |
| Other values (32) | 12046 |
Common
| Value | Count | Frequency (%) |
| 6222 | ||
| . | 160 | 2.4% |
| & | 83 | 1.3% |
| - | 81 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40659 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6222 | ||
| a | 3661 | 9.0% |
| e | 3300 | 8.1% |
| i | 2877 | 7.1% |
| r | 2717 | 6.7% |
| l | 2212 | 5.4% |
| d | 1725 | 4.2% |
| n | 1633 | 4.0% |
| o | 1321 | 3.2% |
| t | 1312 | 3.2% |
| Other values (36) | 13679 |
week_day
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.4 KiB |
| Saturday | |
|---|---|
| Friday | |
| Sunday | |
| Wednesday | |
| Tuesday | |
| Other values (2) |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 7.1378049 |
| Min length | 6 |
Characters and Unicode
| Total characters | 17559 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Sunday |
|---|---|
| 2nd row | Wednesday |
| 3rd row | Wednesday |
| 4th row | Wednesday |
| 5th row | Wednesday |
Common Values
| Value | Count | Frequency (%) |
| Saturday | 396 | |
| Friday | 394 | |
| Sunday | 392 | |
| Wednesday | 379 | |
| Tuesday | 374 | |
| Monday | 277 | |
| Thursday | 248 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| saturday | 396 | |
| friday | 394 | |
| sunday | 392 | |
| wednesday | 379 | |
| tuesday | 374 | |
| monday | 277 | |
| thursday | 248 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2856 | |
| d | 2839 | |
| y | 2460 | |
| u | 1410 | |
| e | 1132 | 6.4% |
| n | 1048 | 6.0% |
| r | 1038 | 5.9% |
| s | 1001 | 5.7% |
| S | 788 | 4.5% |
| T | 622 | 3.5% |
| Other values (7) | 2365 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15099 | |
| Uppercase Letter | 2460 | 14.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2856 | |
| d | 2839 | |
| y | 2460 | |
| u | 1410 | |
| e | 1132 | 7.5% |
| n | 1048 | 6.9% |
| r | 1038 | 6.9% |
| s | 1001 | 6.6% |
| t | 396 | 2.6% |
| i | 394 | 2.6% |
| Other values (2) | 525 | 3.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 788 | |
| T | 622 | |
| F | 394 | |
| W | 379 | |
| M | 277 | 11.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17559 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2856 | |
| d | 2839 | |
| y | 2460 | |
| u | 1410 | |
| e | 1132 | 6.4% |
| n | 1048 | 6.0% |
| r | 1038 | 5.9% |
| s | 1001 | 5.7% |
| S | 788 | 4.5% |
| T | 622 | 3.5% |
| Other values (7) | 2365 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17559 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2856 | |
| d | 2839 | |
| y | 2460 | |
| u | 1410 | |
| e | 1132 | 6.4% |
| n | 1048 | 6.0% |
| r | 1038 | 5.9% |
| s | 1001 | 5.7% |
| S | 788 | 4.5% |
| T | 622 | 3.5% |
| Other values (7) | 2365 |
year
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.4 KiB |
| 2016 |
|---|
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 9840 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2016 |
|---|---|
| 2nd row | 2016 |
| 3rd row | 2016 |
| 4th row | 2016 |
| 5th row | 2016 |
Common Values
| Value | Count | Frequency (%) |
| 2016 | 2460 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2016 | 2460 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2460 | |
| 0 | 2460 | |
| 1 | 2460 | |
| 6 | 2460 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9840 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2460 | |
| 0 | 2460 | |
| 1 | 2460 | |
| 6 | 2460 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9840 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2460 | |
| 0 | 2460 | |
| 1 | 2460 | |
| 6 | 2460 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9840 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2460 | |
| 0 | 2460 | |
| 1 | 2460 | |
| 6 | 2460 |
fiedl_type
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.4 KiB |
| on grass | |
|---|---|
| on turf | 167 |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.9321138 |
| Min length | 7 |
Characters and Unicode
| Total characters | 19513 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | on grass |
|---|---|
| 2nd row | on grass |
| 3rd row | on grass |
| 4th row | on grass |
| 5th row | on grass |
Common Values
| Value | Count | Frequency (%) |
| on grass | 2293 | |
| on turf | 167 | 6.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| on | 2460 | |
| grass | 2293 | |
| turf | 167 | 3.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 4586 | |
| o | 2460 | |
| n | 2460 | |
| 2460 | ||
| r | 2460 | |
| g | 2293 | |
| a | 2293 | |
| t | 167 | 0.9% |
| u | 167 | 0.9% |
| f | 167 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17053 | |
| Space Separator | 2460 | 12.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 4586 | |
| o | 2460 | |
| n | 2460 | |
| r | 2460 | |
| g | 2293 | |
| a | 2293 | |
| t | 167 | 1.0% |
| u | 167 | 1.0% |
| f | 167 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 2460 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17053 | |
| Common | 2460 | 12.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 4586 | |
| o | 2460 | |
| n | 2460 | |
| r | 2460 | |
| g | 2293 | |
| a | 2293 | |
| t | 167 | 1.0% |
| u | 167 | 1.0% |
| f | 167 | 1.0% |
Common
| Value | Count | Frequency (%) |
| 2460 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19513 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 4586 | |
| o | 2460 | |
| n | 2460 | |
| 2460 | ||
| r | 2460 | |
| g | 2293 | |
| a | 2293 | |
| t | 167 | 0.9% |
| u | 167 | 0.9% |
| f | 167 | 0.9% |
| attendance | away_team_errors | away_team_hits | away_team_runs | game_duration | home_team_errors | home_team_hits | home_team_runs | away_team | game_type | home_team | venue | week_day | fiedl_type | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| attendance | 1.000 | 0.018 | -0.042 | -0.049 | 0.050 | -0.018 | 0.001 | 0.024 | 0.119 | 0.123 | 0.425 | 0.426 | 0.130 | 0.402 |
| away_team_errors | 0.018 | 1.000 | 0.033 | 0.047 | 0.131 | 0.005 | 0.145 | 0.206 | 0.033 | 0.000 | 0.000 | 0.000 | 0.028 | 0.000 |
| away_team_hits | -0.042 | 0.033 | 1.000 | 0.759 | 0.487 | 0.166 | 0.101 | 0.046 | 0.000 | 0.000 | 0.067 | 0.067 | 0.000 | 0.000 |
| away_team_runs | -0.049 | 0.047 | 0.759 | 1.000 | 0.458 | 0.257 | 0.087 | 0.035 | 0.030 | 0.000 | 0.064 | 0.062 | 0.000 | 0.000 |
| game_duration | 0.050 | 0.131 | 0.487 | 0.458 | 1.000 | 0.152 | 0.348 | 0.246 | 0.009 | 0.000 | 0.024 | 0.000 | 0.000 | 0.000 |
| home_team_errors | -0.018 | 0.005 | 0.166 | 0.257 | 0.152 | 1.000 | -0.019 | -0.010 | 0.036 | 0.000 | 0.049 | 0.046 | 0.000 | 0.000 |
| home_team_hits | 0.001 | 0.145 | 0.101 | 0.087 | 0.348 | -0.019 | 1.000 | 0.747 | 0.040 | 0.000 | 0.060 | 0.059 | 0.021 | 0.000 |
| home_team_runs | 0.024 | 0.206 | 0.046 | 0.035 | 0.246 | -0.010 | 0.747 | 1.000 | 0.008 | 0.000 | 0.038 | 0.034 | 0.000 | 0.000 |
| away_team | 0.119 | 0.033 | 0.000 | 0.030 | 0.009 | 0.036 | 0.040 | 0.008 | 1.000 | 0.000 | 0.179 | 0.179 | 0.000 | 0.251 |
| game_type | 0.123 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.098 | 0.097 | 0.617 | 0.021 |
| home_team | 0.425 | 0.000 | 0.067 | 0.064 | 0.024 | 0.049 | 0.060 | 0.038 | 0.179 | 0.098 | 1.000 | 1.000 | 0.000 | 0.994 |
| venue | 0.426 | 0.000 | 0.067 | 0.062 | 0.000 | 0.046 | 0.059 | 0.034 | 0.179 | 0.097 | 1.000 | 1.000 | 0.000 | 0.994 |
| week_day | 0.130 | 0.028 | 0.000 | 0.000 | 0.000 | 0.000 | 0.021 | 0.000 | 0.000 | 0.617 | 0.000 | 0.000 | 1.000 | 0.000 |
| fiedl_type | 0.402 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.251 | 0.021 | 0.994 | 0.994 | 0.000 | 1.000 |
| attendance | away_team | away_team_errors | away_team_hits | away_team_runs | date | game_duration | game_type | home_team | home_team_errors | home_team_hits | home_team_runs | start_time | venue | week_day | year | fiedl_type | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 40030.0 | New York Mets | 1 | 7 | 3 | April 3 | 313 | Night Game | Kansas City Royals | 0 | 9 | 4 | 7:38pm | Kauffman Stadium | Sunday | 2016 | on grass |
| 1 | 21621.0 | Philadelphia Phillies | 0 | 5 | 2 | April 6 | 223 | Night Game | Cincinnati Reds | 0 | 8 | 3 | 7:11pm | Great American Ball Park | Wednesday | 2016 | on grass |
| 2 | 12622.0 | Minnesota Twins | 0 | 5 | 2 | April 6 | 311 | Night Game | Baltimore Orioles | 0 | 9 | 4 | 7:07pm | Oriole Park at Camden Yards | Wednesday | 2016 | on grass |
| 3 | 18531.0 | Washington Nationals | 0 | 8 | 3 | April 6 | 253 | Night Game | Atlanta Braves | 1 | 8 | 1 | 7:10pm | Turner Field | Wednesday | 2016 | on grass |
| 4 | 18572.0 | Colorado Rockies | 1 | 8 | 4 | April 6 | 239 | Day Game | Arizona Diamondbacks | 0 | 8 | 3 | 12:40pm | Chase Field | Wednesday | 2016 | on grass |
| 5 | 28386.0 | Seattle Mariners | 1 | 11 | 10 | April 5 | 330 | Night Game | Texas Rangers | 1 | 7 | 2 | 7:07pm | Globe Life Park in Arlington | Tuesday | 2016 | on grass |
| 6 | 12757.0 | Toronto Blue Jays | 0 | 9 | 2 | April 5 | 307 | Night Game | Tampa Bay Rays | 1 | 7 | 3 | 7:10pm | Tropicana Field | Tuesday | 2016 | on turf |
| 7 | 28329.0 | Los Angeles Dodgers | 0 | 6 | 3 | April 5 | 236 | Night Game | San Diego Padres | 1 | 2 | 0 | 7:11pm | Petco Park | Tuesday | 2016 | on grass |
| 8 | 26049.0 | St. Louis Cardinals | 1 | 8 | 5 | April 5 | 327 | Night Game | Pittsburgh Pirates | 2 | 12 | 6 | 7:08pm | PNC Park | Tuesday | 2016 | on grass |
| 9 | 10478.0 | Chicago White Sox | 0 | 11 | 5 | April 5 | 328 | Night Game | Oakland Athletics | 0 | 10 | 4 | 7:08pm | Oakland-Alameda County Coliseum | Tuesday | 2016 | on grass |
| attendance | away_team | away_team_errors | away_team_hits | away_team_runs | date | game_duration | game_type | home_team | home_team_errors | home_team_hits | home_team_runs | start_time | venue | week_day | year | fiedl_type | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2453 | 43683.0 | Philadelphia Phillies | 2 | 6 | 2 | April 4 | 256 | Day Game | Cincinnati Reds | 0 | 6 | 6 | 4:11pm | Great American Ball Park | Monday | 2016 | on grass |
| 2454 | 45785.0 | Minnesota Twins | 0 | 7 | 2 | April 4 | 248 | Day Game | Baltimore Orioles | 0 | 10 | 3 | 4:46pm | Oriole Park at Camden Yards | Monday | 2016 | on grass |
| 2455 | 48282.0 | Washington Nationals | 0 | 8 | 4 | April 4 | 323 | Day Game | Atlanta Braves | 2 | 4 | 3 | 4:13pm | Turner Field | Monday | 2016 | on grass |
| 2456 | 48165.0 | Colorado Rockies | 0 | 15 | 10 | April 4 | 411 | Night Game | Arizona Diamondbacks | 0 | 12 | 5 | 6:42pm | Chase Field | Monday | 2016 | on grass |
| 2457 | 44020.0 | Chicago Cubs | 0 | 11 | 9 | April 4 | 308 | Night Game | Los Angeles Angels of Anaheim | 1 | 3 | 0 | 7:08pm | Angel Stadium of Anaheim | Monday | 2016 | on grass |
| 2458 | 31042.0 | Toronto Blue Jays | 2 | 7 | 5 | April 3 | 251 | Day Game | Tampa Bay Rays | 1 | 7 | 3 | 4:09pm | Tropicana Field | Sunday | 2016 | on turf |
| 2459 | 39500.0 | St. Louis Cardinals | 0 | 5 | 1 | April 3 | 302 | Day Game | Pittsburgh Pirates | 1 | 9 | 4 | 1:15pm | PNC Park | Sunday | 2016 | on grass |
| 2460 | 20098.0 | San Francisco Giants | 0 | 6 | 3 | April 6 | 319 | Day Game | Milwaukee Brewers | 2 | 9 | 4 | 12:41pm | Miller Park | Wednesday | 2016 | on grass |
| 2461 | 17883.0 | Detroit Tigers | 0 | 13 | 7 | April 6 | 322 | Day Game | Miami Marlins | 1 | 10 | 3 | 4:57pm | Marlins Park | Wednesday | 2016 | on grass |
| 2462 | 10298.0 | Boston Red Sox | 1 | 10 | 6 | April 6 | 329 | Night Game | Cleveland Indians | 0 | 9 | 7 | 6:22pm | Progressive Field | Wednesday | 2016 | on grass |